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ABSTRACT 

In a previous paper we investigated means for constraining the mass distribution of 
cluster galaxies by weak lensing. We concluded that a comprehensive method should 
treat the lensing effects of individual cluster galaxies and those resulting from a general 
cluster component simultaneously. To this end we now develop a non-parametric max- 
imum likelihood cluster reconstruction algorithm that can implicitly take into account 
the presence of cluster galaxies. The method includes an entropy-like regularization 
prescription and directly uses the ellipticities of individual source galaxy images as 
observables rather than relying on an averaged ellipticity field. The mass distribution 
of cluster galaxies is described by parametrized models. For each set of galaxy param- 
eters the cluster reconstruction algorithm allows to determine the best representation 
of the global underlying cluster component that is consistent with the presence of the 
cluster galaxies and the observed image ellipticities of background galaxies. Tests with 
simulations yielded convincing and robust results. 

We applied the method to a WFPC2 image of the cluster C10939+47I3 and ob- 
tained a detection of the lensing effects of luminous elliptical cluster galaxies. We 
consider this application as a successful test of our technique. However, the small size 
of the image we analysed does not yet allow to draw strong conclusions regarding the 
mass distribution of cluster galaxies. 

Key words: galaxies: clusters: general - galaxies: clusters: individual: C10939+4713 
- galaxies: haloes - dark matter - gravitational lensing. 



1 INTRODUCTION 

Weak gravitational lensing was suggested by Tyson et al. 
(1984) as a means to obtain information on the mass distri- 
bution of galaxies at large radial distances. The first detec- 
tion of weak distortions of the images of background galax- 
ies generated by the tidal gravitational field of interven- 
ing galaxies was reported by Brainerd, Blandford & Smail 
(1996). Griffiths et al. (1996) studied the galaxy-galaxy lens- 
ing effects in the Medium Deep Survey, and Dell' Antonio & 
Tyson (1996) and Hudson et al. (1997) analysed the Hub- 
ble Deep Field. Natarajan et al. (1997) detected the weak 
lensing effects induced by cluster galaxies in a mosaic of 
HST-images of the cluster AC114. 

In this paper we continue our discussion of methods to 
constrain the mass distribution of cluster galaxies from weak 
lensing presented in Geiger & Schneider (1998, hereafter re- 
ferred to as Paper I). For the maximum likelihood method 
described in that paper we first obtained a reconstruction 
of the cluster mass distribution using 'conventional' tech- 
niques that rely on the inversion of an integral equation. 



Then we added parametrized models for the mass distribu- 
tion of galaxies to the cluster component and applied empir- 
ical mass subtraction prescriptions to keep the total mass of 
the system constant. This approach proved to be workable 
in the outskirts of clusters where the surface mass density is 
low. In the non-linear lensing regime in the cluster centre, 
however, the result turned out to be sensitive to the details 
of the mass reconstruction and subtraction procedures. For 
values of the galaxy model parameters that imply a very 
large fraction of the total mass attributed to galaxies, con- 
ceptional problems arise regardless of the (non-)linearity of 
the lensing effects or the distance from the cluster centre. 

In order to overcome these problems we decided to ex- 
tend the maximum likelihood approach to the problem of 
determining the mass distribution of the cluster component. 
This allows to include prior information on the presence of 
the cluster galaxies directly into the cluster reconstruction 
procedure. We follow the work of Bartelmann et al. (1996) 
and use the two-dimensional gravitational potential as the 
fundamental physical quantity describing the properties of 
the cluster lens. Likelihood methods for cluster reconstruc- 
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tion from weak lensing were also described by Squires & 
Kaiser (1996) and Bridle et al. (1998). These methods use 
a grid of averaged image ellipticities of background galaxies 
as observational input. However, the averaging procedure al- 
ready destroys the information on galaxy scales that we are 
interested in and therefore we are obliged to use the elliptic- 
ities of individual source galaxy images directly as observ- 
ables. Such an approach for cluster reconstruction was inves- 
tigated in detail by Seitz, Schneider & Bartelmann (1998), 
and some elements of the technical implementation of our 
method are similar to their study. 

In Section we describe the combined method for clus- 
ter lens reconstruction and galaxy-galaxy lensing that we 
have developed. The reliability of the method is confirmed 
by an application to simulated observations presented in 
Section In Section ^ we use our techniques to analyse 
a WFPC2 image of the cluster C10939+4713, which yields a 
detection of the lensing effects induced by elliptical cluster 
galaxies. Section JE] concludes the paper with a discussion of 
the results. 



2 METHOD 

We continue to use the concepts and definitions introduced 
in Paper I. In general the symbols for the surface mass den- 
sity, the shear, and the lensing potential should be under- 
stood as referring to (hypothetical) sources located at infi- 
nite redshift. For simplicity we omit the oo-subscript for the 
relevant quantities in this paper if additional subscripts are 
present. 



2.1 Description of the cluster component 

The lensing properties of the cluster component are de- 
scribed by the dimensionless scalar potential ^/>oo. The sur- 
face mass density kc and the shear 7c = 71c + i 72c are 
combinations of second derivatives of this potential: 

K C = ^(^,11 + ^,22), 7lC = ^(V 1 ,!! - ^,22), 720 =^,12- 

In practice, we specify the potential ip a g on a (n + 2) x 
(n + 2) grid (a, /3 = 0, . . . , n + 1), and calculate the cluster 
contribution to the surface mass density kc a fs and the shear 
7Cq/3 = liCa/3 + i72C Q/ 3 on a n x n grid (a, j3 = 1, . . . ,n), 
using discrete versions of the second derivatives: 

KCaf3 — -{lpa+1 + "4>a-10 + Ipa 0+1 + i>a 0-1 ~ 4^) , 
JlCa/3 = ^(ipa+lp + Ipa-lff — 1pa0+l — 1pa 0-l) , and 
72Ca,3 = —(l/ja+l 0+1 + lpa-1 0-1 ~ ifta+l 0-1 ~ Ipa-l 0+l) ■ 

The index values a, f3 = 1 and a, j3 = n correspond to oppo- 
site corners of a square field of view. The lensing parameters 
Kic and 7ic at the positions of individual background galaxy 
images i are obtained by bilinear interpolation of their values 
on the adjacent grid points. They can therefore be expressed 
as linear combinations of the potential components: 



For each background galaxy image the coefficients a i7< s and 
bi-ys are non-zero only for 12, respectively 16, neighbouring 
grid points. 

The total surface mass density re^ and shear 7^ are cal- 
culated by adding the contributions KiGj and 7iGj from the 
cluster galaxies Gj to those of the global cluster component: 



and 



7; 



(2) 



3=1 



3=1 



The values KiGj and jiGj depend on the relative positions of 
background images and cluster galaxies and, of course, on 
the mass distribution of the cluster galaxies. As in Paper I 
we use a truncated singular isothermal sphere as a model for 
their dark matter distribution and apply the scaling laws 



-(e) 



and 



(£)" 



(3) 



with r\ = 4 and v = 0.5 to relate the model parameters to 
those of an L*-galaxy with velocity dispersion cr* and cut-off 
radius s*. This choice of parametrization was critically dis- 
cussed in Paper I and in practice different strategies could 
be more appropriate. However, the principles of the method 
described here do not depend on the details of the galaxy 
parametrization scheme, and for the rest of this section a* 
and s* can be more generally interpreted as representing a 
given set of parameters used to describe the mass distribu- 
tion of the cluster galaxies. 

The likelihood function is defined as the product of the 
probability densities for the observed image ellipticities of 
the background galaxies: 



£(0*, S*, {i>cc0}) = \\ Pc{ti I Ki, 7i) 



(4) 



It depends on the galaxy parameters cr* and s+ and on the 
description of the global cluster component provided by the 
potential values ip a p on the grid points. 

2.2 Probability density distributions 

In Paper I we calculated and illustrated the probability den- 
sity distribution p c (e | k™, 700) for the image ellipticities of 
gravitationally lensed background galaxies. For the present 
method it turned out to be necessary to approximate p c by 
a simple analytical function for numerical reasons. 

In the case of a single redshift plane for the background 
galaxies, or if an estimate for the redshift of individual galax- 
ies is available (e.g., photometric redshifts), the expectation 
value for image ellipticities is determined by the reduced 
shear: 



9 



9 

1/9* 



for 
for 



< 1 
> 1 



(5) 



Contours of the probability density distribution in the el- 
lipticity parameter space e are fairly circular (see Fig. 5 of 
Paper I) and we show in Appendix W that the dispersion of 
the distribution in 'shear direction' is equal to the dispersion 
perpendicular to this direction in the ellipticity coordinates. 
Therefore, the Gaussian distribution 



Kic = 2_, ai ~f 5 and T> c = b *-/ s & 



*y5 



(i) 



7,5=0 



7,5=0 



7TCT?(P) 



(6) 
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represents a convenient approximation to the probability 
density distribution which reproduces its first and second 
moments. The dispersion cr e (g) as a function of g can be 
approximated by a simple analytical fit to numerical results 
similar to the discussion further below. 

In the case of a redshift distribution for the source 
galaxies it is more difficult to specify simple expressions to 
approximate the probability density distributions. In this 
paper we restrict the application of our method to images 
located in non-critical regions of clusters. In this case we can 
use the approximation (Seitz & Schneider 1997) 



<*>« 



for the expectation value of the image ellipticities. In the 
presence of a source redshift distribution the probability dis- 
tribution for the image ellipticities is elongated in the 'shear 
direction' (see Fig. 5 of Paper I), i.e., the dispersion in this 
direction is larger than in the perpendicular direction. Hence 
we model the probability distribution with a Gaussian of the 
form 



p £ (e | too, 7°°) 



1 



2tt (TeAgr) o-i 2 (g T ) 
_ 1 (£i-flr) 2 

• e 



~-> 

C-2 



(T) 



where g r is defined as 

M* I Too 



9t 



and e = ei + i e 2 



l7°o| 



expresses the image ellipticities in 'local shear coordinates', 
i.e., in a coordinate system in which the imaginary part of 
the (reduced) shear vanishes. We approximate the depen- 
dence of the dispersions ai 1 and ai 2 on g T by quadratic 
functions: 

(<?r) « —=. cr Es + cn g t + ci 2 <?, , and 
v2 

a H (ffr) ~ -y= 0"6 S + C21 ffr + C22 <?r • 

For a given intrinsic ellipticity and redshift distribution of 
the sources the constants are determined by a fit to numer- 
ical results as shown in Fig. [jj 



2.3 Likelihood maximization 

The logarithm of the likelihood function is given by 

J(o*,a*, {ip a p}) := In £ = 7^ In p E (ei | k», 7*) . (8) 



For a given set of cluster galaxy parameters (cr*, s*) the like- 
lihood is maximized with respect to the potential values tp a /3 
on the grid points in order to determine the best representa- 
tion of the global cluster component that is consistent with 
the presence of the cluster galaxies as specified by the model. 
Repeating this maximization procedure for varying galaxy 
parameters allows to determine the likelihood l(a+,s+) as a 
function of the galaxy model and to derive confidence re- 
gions in the parameter space. 




0.0 0.2 0.4 0.6 0.8 0.0 0.2 0.4 0.6 0.1 



Figure 1. Approximations for the dispersion of the probability 
density p e (e). The dispersions erg, and ctj 2 were calculated for 
a grid of parameter values of the quantities and |7oo|- The 
crosses in the plots show the results as a function of g r for all 
data points with |<7oo| < 1. The solid lines show fits to these data 
points using quadratic functions. The results displayed in this 
figure were calculated for the intrinsic ellipticity (cr £s = 0.2) and 
redshift distributions that were used in the simulations of Paper I. 



Specifying the parameters for the galaxy mass model 
determines the mass Mg contained in galaxies in absolute 
numbers: 



M G (cr* 



N 







(9) 



Here Lj denotes the cluster galaxy luminosity, but it can be 
more generally interpreted as representing a set of galaxy 
characteristics that are used to specify the mass model for 
individual galaxies. On the other hand, the total mass M of 
the galaxy cluster, or equivalently the average surface mass 
density /too, is provided by a mass reconstruction. The like- 
lihood function defined above only uses information on the 
shapes of background images and therefore it is invariant 
under the mass sheet transformation (see Paper I). How- 
ever, we assume that this degeneracy can be broken by other 
means and so the total cluster mass within the field of view 
is fixed. In order to include the information on the total 
mass into the maximization procedure we adopt the follow- 
ing strategy: The fraction of mass contained in the global 
cluster component and not associated with galaxies is de- 
noted as /c(c*,s*) := 1 — Mg(o*, s*)/M and depends on 
the galaxy model parameters. Multiplication of the likeli- 
hood function with the factor 

\2 



1 («C - fc K,oo) 



P« C («C) = 



2tt 



(10) 



ensures that the average surface mass density 

1 n 
n 2 



of the reconstructed global cluster component reproduces 
the correct mass fraction /c(0*,s*). This corresponds to 
adding the term 



(«C - fc KooY 

2aL 



+ constant 



(11) 



to the logarithm of the likelihood. The value of the param- 
eter <jr c is only of numerical interest and has been chosen 
as 0.001 for the applications described later in this paper. 
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The lensing quantities, surface mass density and shear, 
are second derivatives of the potential ipaa, and therefore 
adding linear functions to the potential does not change the 
mass distribution and the likelihood. It turned out that the 
presence of these invariance transformations does not cause 
practical problems in our implementation of the method. In 
principle they can be suppressed during the maximization 
procedure, for example by fixing the potential values at three 
corners of the field of view as in Bartelmann et al. (1996). 

In order to prevent the cluster component from fitting 
noise and from exhibiting structures on small scales we also 
add an entropy-like regularization term of the form 

n 

S= ln/c Ca/3 (12) 

a,0=l 

(see e.g. Narayan & Nityananda 1986) with 

n 

kCa/3 = HCa/3 / KC~,8 and kCa/3 = K-Ca/3 /kPojS 

7,6=1 

to the logarithm of the likelihood function. An additional 
benefit of this regularization term is that it ensures the re- 
constructed surface mass density to be positive. The factor 
Kp a allows to include prior information on the mass distri- 
bution into the regularization. 

A conjugate gradient algorithm from Press et al. (1992) 
is then used in order to maximize the quantity 

i>*.»*.{W)-' + i« + A5. (13) 

This algorithm makes use of the derivatives dl/dip a p which 
are straightforward to compute analytically (see Appendix 
|b|). The choice of the regularization parameter A allows to 
adjust the result between a pure likelihood maximization 
and a reproduction of the prior mass distribution. In gen- 
eral we use the mass distribution from a conventional recon- 
struction as the prior. In addition, we calculate the potential 
%l>-p a corresponding to this conventional reconstruction and 
use /c(cr*, s^) ipp a p as the start value for the maximization 
algorithm. This guarantees the correct mass fraction for the 
start value of the cluster component as a function of the 
galaxy model parameters and therefore reduces the compu- 
tational effort for the maximization procedure. 



3 APPLICATION TO SIMULATIONS 

In Paper I we divided the data of our simulations into two 
subsets according to the position of the background galax- 
ies within the field of view: A central sub-field containing 
the highly non-linear lensing regime, and the outskirts of 
the cluster where the surface mass density is comparatively 
low. For computational reasons we restrict the application 
of the generalized likelihood method presented in this paper 
to the central data region that covers 2.'5 x 2'.5 and includes 
roughly 90 cluster galaxies and 240 background galaxies. We 
use a conventional inversion method as explained in Paper I 
to reconstruct the mass distribution within the total field 
of view (10' x 10') of the simulations. The lensing potential 
ipoa can be determined by integrating the surface mass den- 
sity appropriately. We then apply the likelihood procedure 
described in the previous section taking the surface mass 



density and the potential in the central sub-field as regular- 
ization prior and start value, respectively. The reconstructed 
mass distribution also specifies the average k^, = 0.32 of the 
total surface mass density within the sub-field, which is re- 
quired for the term l Rc ensuring the correct total mass. 

We explore the dependence of I on the galaxy param- 
eters, velocity dispersion a* and cut-off radius s t , by max- 
imizing the quantity Z(er*,s*) with respect to the potential 
values on the grid points. Fig. || shows confidence regions 
resulting from these calculations for the same realizations 
of cluster and background galaxies as in Paper I. The num- 
ber of grid points used here is 5 x 5 which corresponds to a 
grid point separation of 68ft" 1 kpc. This provides sufficient 
resolution for an adequate description of the global cluster 
mass distribution that had been used as an input for the 
simulations. The value of the regularization parameter was 
chosen as A = 1. 

A comparison of the results for the input model with 
small galaxy haloes (s* = 3.4/i _1 kpc) to those of Paper I 
shows that the confidence regions for the galaxy parameters 
within the plotted range are very similar, but more extended 
with the new method. The simpler approach of Paper I pro- 
vides satisfying results when testing model parameters that 
imply a small fraction of the total mass in galaxies. For 
increasing galaxy mass fraction, however, the difference be- 
tween the (appropriately scaled) conventional mass recon- 
struction and the concept of a global underlying cluster mass 
component becomes larger. In the new method the cluster 
component can adapt to the presence of potentially massive 
galaxies and redistribute the matter in the cluster compo- 
nent accordingly. The additional degrees of freedom lead to 
the larger extension of the confidence regions for increasing 
s*-values. (As in Paper I the range of the cut-off radius pa- 
rameter was confined to small values in the plots in order 
to emphasize the structure of the confidence regions close to 
the input value, and because the inclusion of prior informa- 
tion on the velocity dispersion as discussed in Paper I could 
rule out large values for the cut-off radius in this case.) 

In the model with s* = 34/i _1 kpc the galaxies con- 
tain a considerable fraction of the total mass of the sys- 
tem. In Paper I we investigated strategies for subtracting the 
galaxy masses from a conventional cluster reconstruction in 
order to use it as a description of the 'global cluster com- 
ponent' in the likelihood method described there. For the 
highly non-linear lensing regime in the cluster centre, how- 
ever, this proved to be very difficult and the resulting con- 
fidence regions for the galaxy parameters turned out to be 
rather sensitive to the details of the subtraction procedure. 
These problems are nicely resolved by applying the general- 
ized likelihood method presented in this paper, which takes 
the presence of the galaxies explicitly into account when de- 
termining the best representation of the global cluster com- 
ponent for a given galaxy model. As discussed in Paper I 
the quantity that can be determined best with this lensing 
method is the mass within the projected radius that corre- 
sponds to about the closest separation between cluster and 
background galaxies used in the analysis. As expected the 
confidence regions resulting with the new method are ex- 
tended along lines of equal mass within this radius. Due to 
the freedom of the cluster component to adapt to the galaxy 
model, the extension of the confidence regions is larger than 
in the respective plots of Paper I. 
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Figure 2. Confidence regions for the velocity dispersion and the cut-off radius calculated with the method described in this paper using 
a 5 X 5 grid and the regularization parameter A = 1. The four different realizations of simulated observational data are the same as in 
Paper I. This figure should be compared with the bottom sets of plots in Fig. 7 of Paper I. The plots on the left are for the galaxy input 
model with a cut-off radius of s* = 3.4/i _1 kpc, and the plots on the right are for s* = 34/i — 1 kpc. The confidence contours are 99.73%, 
95.4%, and 68.3%, determined in the way explained in Paper I (without prior information). The triangle denotes the input values and 
the cross marks the maximum of the likelihood function. The dotted line connects models with equal total mass, and along the dashed 
line the mass within a projected radius of 5Ah~ 1 kpc is constant. 




Figure 3. Dependence of the confidence contours on the regularization parameter A. The random realization of cluster and background 
galaxies is the same as for the bottom left plots in the panels of Fig. ^. The values for A are 0.01 (top left), 0.1 (top right), 10 (bottom 
left) and 100 (bottom right). The number of grid points is 5 X 5. The significance of the contours and the meaning of the symbols is 
analogous to Fig. 0. 



In Fig. g we investigated the sensitivity of the results 
to the choice of the regularization parameter. For large val- 
ues of A the 'prejudice' that is provided by the conventional 
reconstruction and that enters the method as the regulariza- 
tion prior, is highly weighted and dominates over the likeli- 
hood term in equation (h3|) . Regularizing too strongly there- 



fore leads to the same kind of problems as discussed for the 
method of Paper I, and the confidence regions approach the 
solution of Paper I for A — * oo. This trend is more obvious 
for the input model with massive galaxies (s* = 34/i _1 kpc), 
where the difference between the results of the two methods 
is larger. Reducing A enables the cluster component to adapt 
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Figure 4. Dependence of the confidence contours on the number 
of grid points. The random realization of cluster and background 
galaxies is the same as for the bottom left plots of Fig. ^ The top 
plots are for an 8 X 8 grid, and in the bottom plots a 12 X 12 grid 
was used. The regularization parameter is A = 1. The significance 
of the contours and the meaning of the symbols is analogous to 
the previous figures. 



to changes in the prescription for the galaxy model more lib- 
erally by maximizing the likelihood term I, and hence leads 
to a larger extension of the confidence regions for the galaxy 
parameters. We discuss the question for the optimal choice 
of the regularization parameter below in context with the 
evolution of the absolute likelihood value. 

Fig. ^ displays confidence contours calculated with a 
finer grid for describing the lensing potential of the cluster 
component. An 8x8 grid corresponds to a grid point sepa- 
ration of 39/i _1 kpc, and a 12 x 12 grid gives 25h~ 1 kpc. In 
the case of the galaxy model with small cut-off radius the 
result shown here is only slightly affected by changing the 
grid, because the parameter range for the cut-off radius cov- 
ered by the plots still remains smaller than the grid point 
separation. In the model with extended (s* = 34ft -1 kpc) 
dark matter haloes for the cluster galaxies, however, a re- 
fined grid resolution already enables the cluster component 
to adapt within a certain extent to the large haloes of lu- 
minous cluster galaxies. Models with a larger fraction of 
the mass in the cluster component and smaller galaxy mass 
fraction have more freedom to adjust to noise. This leads to 
a shift of the maximum likelihood solution towards lower 
values for the s*-parameter and biases against high cut- 
off radius values for which a large mass fraction is fixed in 
galaxies and not available to fit noise. Solving this problem 
by applying a stronger regularization to prevent the cluster 
component from fitting structures on 'galaxy scales' is not 
straightforward because this increases the weight given to 
the conventional reconstruction and leads to the same prob- 
lems as mentioned above. In order to reliably retrieve the 
input values of the simulations even with small grid point 
separations, it could be useful to explore strategies in which 
the prior is adapted during the maximization process. How- 



Figure 5. Dependence of the maximum likelihood value on the 
regularization parameter and on the number of grid points. The 
histograms show the probability distribution of the maximum 
Max(i) of the logarithm of the likelihood function calculated from 
the correct input mass distribution and many different galaxy re- 
alizations. The plots on the left are for the galaxy input model 
with a cut-off radius of = 3Ah~ 1 kpc, and the plots on the 
right are for = 34/i -1 kpc. Dashed vertical lines indicate the 
value of I at the maximum of the likelihood function, and dotted 
vertical lines denote its value for a reconstruction without cluster 
galaxies. The bottom plots depict the dependence on the number 
of grid points for A = 1. With increasing I the lines represent a 
5 X 5, 8 X 8, 10 X 10, and 12 X 12 grid. The top diagrams show the 
dependence on the regularization parameter for a 5 X 5 grid. With 
increasing I the lines are for A = 100, 10, 1, 0.1, and 0.01. How- 
ever, the latter likelihood values are close together, and so the last 
two or three lines cannot be distinguished in this representation. 
[The values for Max(Zlindicated by the vertical lines were calcu- 
lated using equation (hp for the probability distribution of image 
cllipticities, whereas the histograms were determined without this 
approximation. The differences between approximation and exact 
calculation are small and do not change any of the conclusions 
drawn in the text.] 



ever, it is a much more pragmatic point of view to accept 
these difficulties as a manifestation of the fundamental con- 
ceptual problem of making a clear-cut distinction between 
the galaxy mass distribution and a global cluster compo- 
nent. In this philosophy the grid point distance provides the 
scale for a technical separation of the galaxies from a more 
general mass component. 

Finally Fig. ^ depicts the evolution of the maximum 
Max(Z) of the logarithm of the likelihood function when 
changing the regularization parameter and the number of 
grid points. In the case of the 5x5 grid the rather large sepa- 
ration between the grid points already provides some kind of 
regularization to prevent the cluster component from fitting 
noise. Decreasing A increases the maximum of the likelihood 
function only slightly, and Max(Z) stays within the probabil- 
ity distribution p(l) for the logarithm of the likelihood that 
indicates statistical consistency with a correct description 
of the mass distribution. Although the mass reconstructions 
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within the range of regularization values tested here (A = 
0.01 - 100) are all statistically consistent with the observ- 
ables in an absolute likelihood sense, we consider A = 1 as 
the optimal value providing the most reliable results for the 
analysis of our simulations. Regularizing too strongly causes 
the problems that were discussed above. Very small A val- 
ues lead to an extension of the likelihood contours in the 
direction of very high values for the cut-off radius parame- 
ter (see Fig. |^) . Visual inspection of the reconstructed clus- 
ter component for these large cut-off radius solutions reveals 
strongly fluctuating mass distributions which obviously fit 
noise. 

Increasing the number of grid points allows the cluster 
component to adjust to smaller structures, but it also in- 
creases the tendency to fit noise. The dependence of Max(Z) 
on the grid size (for fixed A) is much stronger than its de- 
pendence on the regularization parameter. For a 12 x 12 grid 
with A = 1 the value of Max(Z) is already far in the tail of its 
expected distribution, which suggests that small scale struc- 
tures in the mass distribution of the reconstructed cluster 
component mainly consist of noise. To compensate this effect 
one would need to increase the regularization strength. The 
figure also shows that in the case of extended dark matter 
haloes for the galaxies, the difference in the likelihood values 
between the best reconstruction with galaxies and a recon- 
struction without galaxies is much larger than for the input 
model with small cut-off radius for the cluster galaxies. 



4 APPLICATION TO CL0939+4713 
4.1 Cluster reconstruction 

We test the method described above on a WFPC2 image 
(Dressier et al. 1994b) of the cluster C10939+4713. From 
these observations Seitz et al. (1996, hereafter SKSS) ob- 
tained a reconstruction of the cluster mass distribution by 
applying a non-linear finite-field inversion algorithm. We use 
their galaxy catalogue which supplies positions, ellipticities, 
and R magnitudes. The cluster is at a relatively high red- 
shift of Zd — 0.41. This means that the normalization of 
the reconstructed mass distribution is rather sensitive to 
the unknown redshift distribution of the faint galaxies that 
are used as sources. In addition, due to the small field of 
view which contains the central cluster region only, it is not 
possible to break the mass sheet transformation by assum- 
ing negligible surface mass density values at the boundaries. 
The lack of colour information for the objects detected in the 
field also makes it difficult to break this degeneracy by using 
magnification effects on the number counts or on the sizes 
of the background galaxies. SKSS investigated the implica- 
tions of these problems on their cluster reconstruction. Here 
we restrict the calculations to assumptions for the relevant 
parameters and qualitatively discuss the effects of changing 
them afterwards. The cluster is marginally critical and con- 
tains an arc as well as a multiple image system of a source 
galaxy located at a redshift of 3.98 (Trager et al. 1997). The 
extension of the critical region is very small compared to the 
total area of the field, and so we exclude it from our analysis. 

We consider all galaxies in the magnitude interval R G 
[23.5, 25.5] that fulfill the observational selection criteria of 
SKSS as sources. However, we exclude three objects whose 



images are located between the central cluster galaxies and 
are therefore likely to lie in the critical region. In total we use 
276 galaxy images. Fig. ^ shows the spatial distribution of 
this population and the ellipticity field that was calculated 
from their shapes by employing an averaging procedure with 
a scale of 20". We assume a source redshift distribution of 
the same form as it was used for our simulations, i.e., equa- 
tion (10) of Paper I with the parameters (z) — 1 and (3 = 1. 
This gives a value of (w) = 0.44 for the average relative lens- 
ing strength (we use Q — 1 and A = 0). In analogy to SKSS 
we reconstruct the cluster mass from the ellipticity field by 
applying the methods developed in Seitz & Schneider (1996) 
and Seitz & Schneider (1997). Our result is consistent with 
theirs. We scale the reconstructed mass distribution such 
that the average surface mass density is Koo = 0.35 by us- 
ing the mass sheet transformation in the form generalized 
to a redshift distribution of the sources (Seitz & Schneider 
1997). This gives a total mass of 1.97 x 10 14 /t _1 Mq within 
the field of view. 

We calculate the lensing potential from the surface mass 
density. This involves an integration that formally extends 
over the whole lens plane, whereas the data region is limited. 
The shear contribution provided by the matter outside of the 
field of view cannot be neglected here, because the surface 
mass density values are still appreciable at the boundary. 
We extrapolate the reconstructed mass distribution with a 
simple linear prescription outside of the data region before 
calculating the potential %j) a p on a grid. Therefore, the re- 
sult correctly describes the mass distribution in the field 
of view as it was determined from the reconstruction, but 
it contains only an approximate description of the shear. 
Nevertheless, it serves as a reasonable start value for the 
maximum likelihood algorithm. The arbitrary element in- 
troduced by the extrapolation procedure is then of course 
resolved during the likelihood maximization which adjusts 
the potential such that the shear is optimally reproduced. 
We use the surface mass density values on the grid points cal- 
culated from the initial potential grid as the regularization 
prior Kp a p. The missing quadrant of the wide field camera 
does not cause a problem for the technical implementation 
of the method. The potential values on the irrelevant grid 
points are excluded from the maximization procedure. The 
calculation of the entropy S and the average surface mass 
density Rc is restricted to the data region. For the probabil- 
ity distribution of image ellipticities the approximation (W) 
is applied with dispersions calculated in analogy to Fig. Ill 
taking into account the different cluster redshift and using 
the same intrinsic ellipticity dispersion of a 6s = 0.2 as for 
the simulations. 

Fig. |6] shows maximum likelihood reconstructions of the 
cluster C10939+4713 that were performed with our algo- 
rithm without taking the presence of the cluster galaxies 
into account. Using a 6 x 6 grid for the surface mass den- 
sity gives a grid point separation of 101/i -1 kpc and allows 
to resolve the main features characterizing the mass dis- 
tribution. As discussed by SKSS the two mass maxima in 
the bottom quadrants and the minimum in the top right 
quadrant tend to correlate with the distribution of bright 
galaxies. This trend is also visible in the diagram depict- 
ing the distribution of our cluster galaxy sample which will 
be defined below. The galaxies in the top right quadrant 
are mainly less bright spirals and a substantial fraction of 
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Figure 6. The galaxy and mass distribution in the WFPC2 field of the cluster C10939+4713. The top left plot displays the galaxies 
in the magnitude interval R £ [23.5, 25.5] which were used as sources for the weak lensing analysis. Each galaxy is represented by an 
ellipse which indicates its ellipticity, position angle and relative size. Superposed on these individual galaxy images is their average 
ellipticity field on a 20 X 20 grid. The length of the bar in the top left corner of this plot corresponds to an average ellipticity of |e| =0.1. 
The top right plot shows all the galaxies that were included as cluster galaxies in our analysis. The ellipses drawn with solid lines 
represent objects which were classified as E- or SO-galaxies in Smail et al. (1997), and the broken lines indicate those classified as spiral 
or irregular galaxies. The bottom plots depict reconstructions of the cluster mass distribution using our maximum likelihood method 
without including the presence of cluster galaxies. The left plot was produced with a 6 X 6 grid, and the plot on the right was obtained 
with a 12 X 12 grid. For the purpose of this representation the reconstructions have been bicubically interpolated on a finer grid before 
using the contour plot program. The regularization parameter was A = 10. The contour lines depict the surface mass density values 
Koo = 0.1, 0.2, 0.3, . . . , 1.0. The mass sheet degeneracy was artificially adjusted such that the average surface mass density in the data 
region is Soo = 0.35. The side length of the field of view is 2.'58 which corresponds to 505/i _1 kpc at the cluster redshift of = 0.41. 
(We assume Q = 1 and A = in this paper.) 



them might not be cluster members. A 12 x 12 grid with 
a grid point separation of 46/i _1 kpc enables the mass dis- 
tribution to adjust to smaller structures. The weak lensing 
reconstruction now produces a high mass peak roughly at 
the position of the strong lensing features. The height of 
this maximum depends on the regularization strength. Its 
exact position depends on the number of grid points, be- 
cause in the method maxima can obviously only occur at 



grid points. The positional uncertainty of the mass peak is 
such that its location is consistent with coinciding with the 
strong lensing region. SKSS were not able to provide evi- 
dence for such a rather steep increase of the surface mass 
density at this position, because inversion algorithms which 
rely on the averaged ellipticity field as input tend to smear 
out such features, whereas the maximum likelihood method 
applied here uses the individual image ellipticities directly 
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as observables (see also Seitz et al. 1998). However, the re- 
liability of the smaller scale structures in this mass map is 
doubtful and their significance is difficult to estimate. We 
will come back to this question at the end of the next sec- 
tion in context with a discussion of the absolute likelihood 
value. 

The strong lensing features are caused by the joint ef- 
fects of a global cluster component and several bright cluster 
galaxies located in the cluster centre. We do not attempt 
to reproduce the multiple image system here, because the 
spatial resolution reliably accessible to the non-parametric 
reconstruction is considerably larger than the image sepa- 
ration. This also means that it is very difficult to use the 
strong lensing information for breaking the mass sheet de- 
generacy and to determine the slope of the central cluster 
profile in this case. 
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4.2 Cluster galaxy analysis 

To select cluster galaxies we used the morphological clas- 
sification and instrumental magnitudes (-R702) provided by 
Smail et al. (1997). Galaxies that were denoted as 'com- 
pact' or 'unclassifiable', or for which measured redshifts from 
Dressier & Gunn (1992) exclude a cluster membership, were 
not taken into account. We define an L*-galaxy by requiring 
an absolute V-magnitude of —20.5+5 log h (corresponding to 
mv = 20.2), which is close to the value determined by Smail 
et al. (1997) from a fit to the luminosity function of elliptical 
galaxies in clusters of this redshift range. For direct compari- 
son we take the same value for spiral galaxies as well. For the 
cluster redshift of Zd = 0.41 the R-band filter probes roughly 
the same spectral range as a rest frame V-band filter, and 
the correction factors for the conversion of these niters are 
small (B. Ziegler, private communication). For our cluster 
galaxy sample we therefore identify R-magnitudes from the 
catalogue with rest-frame V-magnitudes. Possible system- 
atic errors introduced by this simplification are in any case 
smaller than those resulting from other problems that will 
be discussed later. As in our simulations we only include 
galaxies with a luminosity brighter than 0.1L* (my < 22.7) 
in our analysis, because very faint cluster galaxies do not 
contribute much to the lensing signal. According to the mor- 
phological classification we divide our sample into two sub- 
sets. One of them includes 56 elliptical and SO-galaxies and 
the other contains 55 objects classified as spiral or irregu- 
lar galaxies. Fig. ^ schematically displays the galaxies that 
were included in our cluster galaxy sample. It is very likely 
that some of the objects in the sample are in fact fore- or 
background galaxies and the implications of this problem on 
the results of the lensing analysis will be discussed below. 
Fig. 4 of Dressier et al. (1994a) and Fig. 6 of Belloni et al. 
(1995) suggest that especially in the spiral galaxy subset a 
significant fraction could be interloping field galaxies. 

Fig. displays the results of applying the galaxy-galaxy 
lensing method developed in Section ^. Here we chose a 6 x 6 
grid for the cluster mass distribution which provides suffi- 
cient resolution for an adequate description of the reliably 
reconstructed features. As a start value for the potential ip a /3 
we took the maximum likelihood solution without galaxies 
that has been discussed in the previous section. In addition, 
the corresponding cluster mass distribution was taken as 
the regularization prior. The regularization parameter was 
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Figure 7. Results of applying the galaxy-galaxy lensing method 
presented in this paper to a WFPC2 image of the cluster 
C10939+4713. The top plot shows confidence contours for the 
parameters describing the halo properties of an ensemble of 56 
elliptical cluster galaxies, and the bottom plot is for 55 spiral 
galaxies. Note the change of scale at s* = 10h~ 1 kpc. The calcu- 
lations were performed with a 6 X 6 grid for the cluster mass dis- 
tribution, a regularization parameter of A = 100, and the values 
r\ = 4 and v = 0.5 for the parameters of the scaling laws relating 
the properties of galaxies with different luminosities to those of an 
L*-galaxies, which we defined as having an absolute V-magnitudc 
of —20.5 + 5 log h. The likelihood function includes shape informa- 
tion from 276 background galaxy images. The confidence contours 
are 99.73%, 95.4%, and 68.3%, determined in the way explained 
in Paper I, and the cross marks the maximum of the likelihood 
function. The dashed lines connect models with an equal aper- 
ture mass of M*(< 8/i _1 kpc) = 0.1,0.5, and 1.0 X 10 11 h~ 1 M ( 7 ) , 
respectively. Similarly, the dotted lines are for a total L*-galaxy 
mass of M* = 0.1,0.5, 1.0,5.0 and 10 X lO 11 /! -1 M , which corre- 
sponds to a 'galaxy mass fraction' of 0.15%, 0.75%, 1.5%, 7.5% and 
15%, respectively. As discussed in the text, the results displayed 
here depend on assumptions regarding the redshift distribution of 
the sources and the mass sheet transformation! Changing these 
assumptions mainly leads to a shift of the confidence contours in 
a* -direction. 



chosen as A = 100. In analogy to our simulations we use 
the scaling laws ^ to relate the velocity dispersion and the 
cut-off radius of galaxies with different luminosity to those 
of an L*-galaxy, and we fix the scaling indices at the val- 
ues 77 = 4 and v = 0.5. We performed the calculation for 
the ensemble of 56 elliptical galaxies ignoring the presence 
of the spiral galaxies at first. For given values of the galaxy 
model parameters the method searches for the best repre- 
sentation of the remaining cluster mass component leaving 
the total mass of the system constant. Reassuringly, the re- 
sulting likelihood contours for the galaxy model parameters 
a* and s* show qualitatively the same properties as in our 
simulations. Despite the small number of lens and source 
galaxies, the lensing effects of the elliptical cluster galaxies 
are detected with high significance: Reducing the mass of the 
cluster component and putting it into galaxies increases the 
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likelihood of the observed image ellipticities of background 
galaxies. As expected, the likelihood contours are extended 
along lines connecting models that imply equal mass for an 
L+-galaxy within some characteristic radius. We empirically 
determined a value of 8/i _1 kpc by adjusting the dashed lines 
in the figure to the shape of the confidence contours in the 
low-s* region of the plot. Therefore the mass within this 
radius is the best-determined quantity from this analysis. 

The figure also includes the result of an equivalent anal- 
ysis of the spiral galaxy sample. In contrast to the ellipticals 
we do not find a signal in this case, although the number 
of galaxies and the range of luminosities are comparable for 
the two sets of galaxies. However, there is a factor of %/2 
difference between the characteristic velocity dispersion pa- 
rameter for elliptical and spiral galaxies of the same lumi- 
nosity. This translates into a factor of two difference in the 
expected strength of the lensing effects, and so it is not sur- 
prising that the result is weaker for the spiral galaxy sample. 
As was mentioned above some of the spirals in this sample 
might actually be background galaxies, whereas we are inter- 
ested in the properties of cluster galaxies here. Of course this 
could lead to a bias of the result, simply because they are 
the wrong kind of objects included in the sample, but also 
because their size and luminosity, as well as the strength of 
their lensing effects on the population of faint source galax- 
ies would then be different than tested by the model, which 
assumes that they are located at the cluster redshift. In con- 
trast to that, not taking into account cluster galaxies, which 
actually are present, does not introduce systematic effects: 
Repeating the above calculation of the confidence contours 
for spirals and ellipticals with inclusion of the best-fit model 
for the other galaxy type only leads to negligible changes of 
the result. The fact that non-cluster galaxies that were mis- 
classified as cluster members can cause systematic effects 
whereas leaving out genuine cluster members does not, sug- 
gests that it is useful to apply conservative selection criteria 
for defining the cluster galaxy sample. 

We checked the sensitivity of the result to changes of 
the regularization strength or the number of grid points that 
are used to describe the cluster potential. Fig. ^ displays 
the confidence contours for a reduced regularization param- 
eter of A = 10 (keeping a 6 x 6 grid), as well as for an 
8x8 grid for the cluster component (keeping A = 100). 
These calculations necessitate a larger amount of computa- 
tion time than those performed for Fig. 0. The appearance 
of the confidence regions does not change very much with 
respect to our previous result. Now the axes of the plots 
are outside of the 99.73% confidence contour in both cases. 
(The axes correspond to model parameters implying zero 
mass in galaxies and therefore represent a pure cluster mass 
reconstruction without cluster galaxies.) This means that 
the galaxy-galaxy lensing signal becomes slightly more sig- 
nificant when the ability of the cluster component to adjust 
to the presence of the cluster galaxies is increased either by 
improving the resolution or by reducing the regularization 
strength. The confidence contours in Fig. |§| (but not those 
in Fig. [?]) have been slightly smoothed in order to erase 
some discontinuities caused by a premature stopping of the 
maximization algorithm for some parameter values. This be- 
comes a more serious problem for a larger number of grid 
points or a smaller regularization parameter. For the refined 
resolution achieved with the 8x8 grid, which corresponds 
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Figure 8. Dependence of the result for elliptical cluster galaxies 
on the number of grid points and the regularization strength. The 
top plot is for A = 10 with a 6 X 6 grid, and the bottom one for 
A = 100 with an 8 X 8 grid. The significance of the contours and 
the meaning of the lines is analogous to the previous figure. 

to a grid point separation of 72h~ kpc, the confidence con- 
tours are less extended along the cut-off radius coordinate. 
As we discussed in Section this effect can be explained 
by the ability of the cluster component to adapt to mass 
structures on scales comparable to the largest cut-off radius 
values shown in the plot. 

To perform our analysis we were obliged to make as- 
sumptions on the redshift distribution of the faint objects in 
our background galaxy sample. Underestimating the source 
redshifts leads to an overestimate of the lens masses and 
therefore to a shift of the cluster galaxy confidence contours 
towards higher values for the velocity dispersion parameter. 
Conversely, overestimating the source redshifts leads to an 
underestimate of the galaxy masses. However, in our ap- 
plication the mass sheet degeneracy constitutes an addi- 
tional complication. We broke this degeneracy artificially 
by fixing the average surface mass density at the value 
Koo = 0.35. For the redshift distribution that we have as- 
sumed this is roughly the minimal value that ensures that 
the mass density is non-negative over the whole field of 
view. Changing the assumptions for the redshift distribu- 
tion leads to a change of the minimal possible value for the 
source redshift- independent quantity Koo- Equally viable so- 
lutions with larger provide more convergence and require 
smaller variations in the mass distribution - and thus smaller 
galaxy masses - to produce the same observable image dis- 
tortions. Adjusting /too therefore allows to scale the resulting 
confidence contours for the cluster galaxy model parameters 
in the direction of the cr*-axis of the plot. In future appli- 
cations with larger fields of view, dealing with this techni- 
cal problem will hopefully become obsolete. The problems 
mentioned here make it impossible to obtain tighter limits 
on the cut-off parameter s* by including prior knowledge on 
the velocity dispersion parameter <j* as it was envisaged in 
Paper I. 
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Figure 9. Constraints on the scaling parameter r) for ellipti- 
cal cluster galaxies. The plot shows 99.73%, 95.4%, and 68.3% 
confidence contours for the velocity dispersion ct* and the scal- 
ing index rj. Here we assumed a characteristic cut-off radius of 
= 8h _1 kpc, corresponding to the maximum of the likelihood 
function in Fig. Q and a cut-off radius scaling index of v = 0.5. 
The number of grid points is 6 X 6 and the regularization param- 
eter is A = 100. 

The procedure that had been used by SKSS to deter- 
mine image ellipticities for the faint objects in the source 
galaxy catalogue did not include a detailed analysis of in- 
strumental properties as it was carried out by Hoekstra 
et al. (1997). However, typical distortion values in the field 
we analysed are considerably larger than their instrumental 
corrections and so we believe that the implications on our 
results are negligible. 

To obtain the results shown so far we chose r\ — 4 for the 
index of the scaling law that relates the velocity dispersion 
parameter of galaxies with different luminosities to those 
of an L*-galaxy. This choice was motivated by the observed 
Faber- Jackson relation for the velocity dispersion of the stars 
in elliptical galaxies. If we assume a roughly isothermal mass 
density profile for the galaxies a similar relation should also 
hold for the velocity dispersion parameter a of their dark 
matter halo. In Fig. ^| we explored the sensitivity of the 
result to the choice of the scaling index. The figure shows 
confidence contours in the parameter space defined by the 
velocity dispersion cr* and the scaling index r\. The value 
of rj = 4 lies within the 68% confidence contour and the 
likelihood difference compared to the maximum of the like- 
lihood function is insignificant. We consider this figure as 
a necessary check if the information provided by lensing is 
consistent with our observationally motivated assumptions, 
and not as a serious attempt to constrain the parameter r\ 
from the scarce data available here. For this plot the value 
for the cut-off radius parameter was fixed at = 8/t -1 kpc, 
which corresponds to the maximum of the likelihood func- 
tion in Fig. |^ Changing s* favours different values for the 
velocity dispersion parameter and leads to a shift of the con- 
tours along the ov-coordinate in Fig. ^| However, this does 
not affect the conclusions regarding the scaling index that 
we made above. Another parameter whose importance for 
our results we did not yet discuss is the index v for the cut- 
off radius scaling law. In contrast to the parameter ij we do 
not have a sound motivation for a particular choice of this 
index. Changing this parameter between v = and v = 1 
only leads to marginal differences in the likelihood and so it 
is impossible to achieve limits on this parameter from these 



data. In addition, the implications on the appearance of the 
confidence contours in Figs. |^ to |i| for which v — 0.5 was 
assumed, are very small. 

We now turn to a discussion of the absolute likelihood 
level of our reconstructed solutions. In order to determine 
the range of values for I that indicate statistical consistency, 
we generate random intrinsic ellipticities for the objects in 
our source galaxy catalogue. Taking the reconstructed so- 
lution for the cluster we then calculate their lensed image 
ellipticities and from that the value of the logarithm of the 
likelihood. Repeating that procedure for many different in- 
trinsic realizations allows to construct a histogram for p(l) 
corresponding to those shown in Fig. ^ for the simulations. 
As a result we obtain Z-values approximately between 260 
and 300. However, the 6x6 grid reconstruction without 
galaxies presented in Fig. [] only reaches a log-likelihood 
level of I — 138 for the actually observed image ellipticities, 
which is far below the value tolerated for an acceptable solu- 
tion. Including cluster galaxies allows to increase this value 
by 5 or 6, but refining the cluster grid is much more effec- 
tive in this respect. The 12 x 12 grid solution also shown 
in Fig. ^| attains a value as high as I — 172, which how- 
ever still falls short of the expected level by a large amount. 
(The expected level as determined above does not depend 
very much on the choice for the number of grid points.) 
In theory the number of grid points and the regularization 
strength should be adjusted such that consistent likelihood 
values are reached. This works quite well for our application 
to simulated observations described in Section |§] where we 
assumed that the probability density distributions for the 
intrinsic ellipticity and the redshift of the source galaxies 
are known. For the application to real data discussed here, 
however, this is not the case and probably the problem that 
we face here - and that is reflected by the discrepant log- 
likelihood levels - is not an inadequate description of the 
cluster mass distribution, but rather an insufficient knowl- 
edge of the properties of the faint galaxies that we have used 
as lensed sources for our study. In particular we made sim- 
ple assumptions regarding the redshift distribution and the 
intrinsic ellipticity distribution of these galaxies. In practice 
these could be quite different for galaxies of different mag- 
nitude, surface brightness or morphology. In order to use 
a check of the absolute likelihood level as a powerful tool 
for evaluating the reliability of our description of the clus- 
ter mass distribution, a detailed understanding of the source 
galaxy population is warranted. 



5 DISCUSSION 

Continuing our discussion of Paper I we investigated meth- 
ods to constrain the mass distribution of cluster galaxies by 
weak lensing. In Paper I we concluded that a comprehensive 
method should simultaneously take into account the lensing 
effects of individual cluster galaxies as well as those of a 
general cluster component. To this end we developed a reg- 
ularized cluster lens reconstruction algorithm that directly 
uses the image ellipticities of individual background galaxies 
as observables, and that allows to explicitly include the pres- 
ence of cluster galaxies in the analysis. We parametrize the 
mass distribution of cluster galaxies by truncated isother- 
mal spheres and apply simple scaling laws to relate the halo 
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properties to those of L*-galaxies according to the galaxy 
luminosity. For each value of the galaxy parameters - ve- 
locity dispersion <r* and cut-off radius s* of an L+-galaxy - 
the method allows to determine the best representation for 
the underlying cluster mass component that is consistent 
with the observed image ellipticities. We tested the method 
on simulations and achieved convincing and robust results. 
If the dark matter distribution around cluster galaxies is 
rather extended, it is not straightforward to make a clear-cut 
distinction between mass belonging to a global cluster com- 
ponent or to galaxy haloes. An important result from these 
simulations is that the distance between the grid points of 
the cluster component determines the scale for a somewhat 
artificial separation of the mass located in galaxies and the 
mass attributed to the global cluster component. 

We applied our method to a WFPC2 image of the 
galaxy cluster C10939+4713. Our reconstruction of the clus- 
ter mass distribution is consistent with the result of SKSS 
who used the same object catalogue but a different recon- 
struction technique. However, the robustness of the result 
should finally be verified by using observations with a larger 
field of view. This would also allow to break the mass sheet 
degeneracy that prevents us from deriving absolute mass es- 
timates here. In this respect another factor of uncertainty is 
the unknown redshift distribution of the source galaxies. 

The galaxy-galaxy lensing analysis provided a signifi- 
cant detection (approximately at the 99% to 99.9% confi- 
dence level) of the lensing effects induced by luminous ellip- 
tical cluster galaxies. In fact it had already been speculated 
by Dressier et al. (1994b) that some of the arclets in this 
WFPC2 image 'appear to be background sources lensed by 
individual galaxies rather than the overall cluster potential'. 
The likelihood function attains its maximum at rather low 
values for the cut-off radius parameter. However, the small 
number of lens galaxies included in our study does not al- 
low to derive strong limits for the parameters describing 
the galaxy haloes. Changing the assumptions that we were 
obliged to make regarding the mass sheet degeneracy and 
the redshift distribution of the sources leads to a rescaling 
of the galaxy-galaxy lensing result. For this reason we do not 
quote galaxy mass-to-light ratios or values for the fraction 
of the total mass bound to galaxies in this paper. Our result 
is consistent with the result of Natarajan et al. (1997) who 
applied the methods of Natarajan & Kneib (1997) to analyse 
a mosaic of WFPC2 images of the cluster AC114. However, 
significant differences of their approach compared to ours 
make a quantitative comparison of the results difficult. 

We consider the application to C10939+4713 described 
in this paper as a successful test of our maximum likeli- 
hood technique for galaxy-galaxy lensing in clusters of galax- 
ies. The problems we encountered are not inherent to our 
method and mainly resulted from the small size of the im- 
age we analysed. Ideal observations for this project should 
include deep exposures with high image quality and large 
fields of view, which would make it feasible to test possi- 
ble variations of the halo properties with the density of the 
environment. Another important ingredient should be sup- 
plementary imaging in different colours in order to assure a 
reliable selection of cluster galaxies and potentially for de- 
riving photometric redshift estimates for the galaxies used 
as sources. The benefit of photometric redshift information 
for galaxy-galaxy lensing studies (in the field) was empha- 



sized by Schneider & Rix (1997) and has been demonstrated 
by Hudson et al. (1997) in their analysis of the Hubble Deep 
Field. Making use of available redshift information for indi- 
vidual source galaxies also allows us to reduce the noise in 
cluster reconstructions. [In our implementation the approx- 
imation (Q) for the probability density distribution of image 
ellipticities can then be replaced by the simpler expression 
(^).] This was also recognized by Seljak (1997) in a similar 
context. 

The analysis of the kind of observational data sets en- 
visaged above, which are most likely to be available within 
the next few years, will provide the opportunity to com- 
pare the lensing constraints to recent numerical studies of 
the properties of galaxy haloes within clusters (Tormen, Di- 
aferio & Syer 1997, Ghigna et al. 1998). In addition, a com- 
parison and combination of the galaxy-galaxy lensing results 
with morphological studies of the galaxy population in these 
clusters (e.g. Oemler, Dressier & Butcher 1997) could help 
to obtain observational evidence for the physical processes 
[such as the 'galaxy harassment' picture suggested by Moore 
et al. (1996)] that might be responsible for the evolution of 
cluster galaxies. 
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APPENDIX A: 
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ainp £ (e) 71 d\np e (e) 72 oo 

<9|7o°l l7oo| d(j) 1 |7oo| 5 

dlnp e (e) 72oo ( d\np t (e) 71 
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dip a 13 
d-lpa/3 



The symbol 4>~t represents the phase of the complex shear 
parameter 700 = |7oo|e I ^' T . The derivative of lnp e (e) with 
respect to (too is given by 



d\np e (e) 



Cl 



el 



dg r 



8k, 
da i2 (g T ) 



°"si(5r) + (ei - g r ) 



^l(gr) 



dvti (9r) 



a ii (Si) dn 



ai 2 (g r ) dn a 



a similar expression holds for <91np £ (e) /d\-yoo\, and the 
derivative with respect to 7 is 

dimple) _ (ei - gr) h e 2 h 
d<h-, a? cr? 



In this appendix we prove a statement given in Section 2.2 
concerning the dispersion of the probability density distribu- 
tion for the image ellipticity of lensed background galaxies. 
Seitz & Schneider (1997) showed that the moments of the 
complex ellipticity parameter e can be expressed in terms of 
the reduced shear: 



(e")^=g" for \g\ < 1 



(Al) 



In the following we perform a rotation of the ellipticity coor- 
dinates such that the imaginary component of the (reduced) 
shear vanishes, i.e., we transform g and e into 'local shear 
coordinates'. This can be expressed as 



2_ 



and 



We calculate the expectation value of e 2 
by applying equation (Al): 

(? 2 k = (ei)e s - (elk + 2(ei h). 3 i = f ■ 



2_ 

el- 62 + 2ei? 2 i 



(A2) 



The quantity g is real and so it follows from this equation 
that (ei ?2}e b = 0. Now we can express the variance of <L\ as 



2 



2 
°l2 



In the first equality we used equation (Al) and the second 



follows from equation ( |A2| ) with the definition of a s := 

(el)e a — (£2} 2 S and (e2)e a = which again follows from ( Al) 



Thus the dispersion of the probability density distribution 
in local shear direction is equal to the dispersion in the per- 



pendicular direction: ai 1 
be derived analogously. 



For 



> 1 this result can 



The derivatives remaining in these formulae are trivial to 
calculate from the definitions of the respective quantities. 
The derivative of the term that ensures the correct total 
mass reads 



G>lnp« c 



7,0 = 1 



and the entropy term yields 
OS dS OkcjS 



with 



E 

7,5=1 

1 



8kc-/S dip a p 



OS dkcec 9kc 1 s 

^ OkceC 9kc 7 5 dlp a 
7,<5=1 e,C=l 



E 



and 



dS 



Although these expressions look rather complicated they can 
be conveniently implemented technically and do not cause 
major computational problems. 



APPENDIX B: 

This appendix gives the expressions needed to calculate the 
derivatives of / with respect to the potential values on the 
grid points. The derivative of the logarithm of the probabil- 
ity density can be written as 

*91np e (e) dlnp e (e) dK x 

dlp al 3 SKoo dlpaf) 
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